Valid Probabilistic Predictions for Ginseng with Venn Machines Using Electronic Nose
نویسندگان
چکیده
In the application of electronic noses (E-noses), probabilistic prediction is a good way to estimate how confident we are about our prediction. In this work, a homemade E-nose system embedded with 16 metal-oxide semi-conductive gas sensors was used to discriminate nine kinds of ginsengs of different species or production places. A flexible machine learning framework, Venn machine (VM) was introduced to make probabilistic predictions for each prediction. Three Venn predictors were developed based on three classical probabilistic prediction methods (Platt's method, Softmax regression and Naive Bayes). Three Venn predictors and three classical probabilistic prediction methods were compared in aspect of classification rate and especially the validity of estimated probability. A best classification rate of 88.57% was achieved with Platt's method in offline mode, and the classification rate of VM-SVM (Venn machine based on Support Vector Machine) was 86.35%, just 2.22% lower. The validity of Venn predictors performed better than that of corresponding classical probabilistic prediction methods. The validity of VM-SVM was superior to the other methods. The results demonstrated that Venn machine is a flexible tool to make precise and valid probabilistic prediction in the application of E-nose, and VM-SVM achieved the best performance for the probabilistic prediction of ginseng samples.
منابع مشابه
Discrimination of American ginseng and Asian ginseng using electronic nose and gas chromatography–mass spectrometry coupled with chemometrics
BACKGROUND American ginseng (Panax quinquefolius L.) and Asian ginseng (Panax ginseng Meyer) products, such as slices, have a similar appearance, but they have significantly different prices, leading to widespread adulteration in the commercial market. Their aroma characteristics are attracting increasing attention and are supposed to be effective and nondestructive markers to determine adulter...
متن کاملMultiprobabilistic Venn Predictors with Logistic Regression
This paper describes the methodology of providing multiprobability predictions for proteomic mass spectrometry data. The methodology is based on a newly developed machine learning framework called Venn machines. They allow us to output a valid probability interval. We apply this methodology to mass spectrometry data sets in order to predict the diagnosis of heart disease and early diagnoses of ...
متن کاملReliable Probabilistic Prediction for Medical Decision Support
A major drawback of most existing medical decision support systems is that they do not provide any indication about the uncertainty of each of their predictions. This paper addresses this problem with the use of a new machine learning framework for producing valid probabilistic predictions, called Venn Prediction (VP). More specifically, VP is combined with Neural Networks (NNs), which is one o...
متن کاملReliable Probability Estimates Based on Support Vector Machines for Large Multiclass Datasets
Venn Predictors (VPs) are machine learning algorithms that can provide well calibrated multiprobability outputs for their predictions. The only drawback of Venn Predictors is their computational inefficiency, especially in the case of large datasets. In this work, we propose an Inductive Venn Predictor (IVP) which overcomes the computational inefficiency problem of the original Venn Prediction ...
متن کامل